- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources3
- Resource Type
-
0002100000000000
- More
- Availability
-
12
- Author / Contributor
- Filter by Author / Creator
-
-
Sung, Yoo Yeon (3)
-
Boyd-Graber, Jordan Lee (2)
-
Fleisig, Eve (2)
-
Bandyopadhyay, Saptarashmi (1)
-
Boyd-Graber, Jordan (1)
-
Chandra, Abhranil (1)
-
Gor, Maharshi (1)
-
Hope, Yu (1)
-
Kabir, Tasnim (1)
-
Mondal, Ishani (1)
-
Upadhyay, Ishan (1)
-
Zou, Hao (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
& Abreu-Ramos, E. D. (0)
-
& Abramson, C. I. (0)
-
& Abreu-Ramos, E. D. (0)
-
& Adams, S.G. (0)
-
& Ahmed, K. (0)
-
& Ahmed, Khadija. (0)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
As AI use becomes more common, it's important to measure not just whether the systems are correct but whether they know when they're incorrect. We propose a new metric to measure this mismatch between correctness and confidence, compare computer ability with human ability, and show that computers have a long way to go before they're well-calibrated.more » « lessFree, publicly-accessible full text available July 27, 2026
-
Sung, Yoo Yeon; Gor, Maharshi; Fleisig, Eve; Mondal, Ishani; Boyd-Graber, Jordan Lee (, Association for Computational Linguistics)Free, publicly-accessible full text available January 1, 2026
-
Kabir, Tasnim; Sung, Yoo Yeon; Bandyopadhyay, Saptarashmi; Zou, Hao; Chandra, Abhranil; Boyd-Graber, Jordan Lee (, Association for Computational Linguistics)Many of the questions for training AIs how to answer questions come from the queries users type into search engines (like Google's Natural Questions). Is there a cheaper---perhaps even better---way? We propose a "naturalization" technique to turn high-quality, rigorously edited trivia questions into examples that resemble Natural Questions. Training on our naturalized questions and testing on natural questions comes close to the results with using Natural Questions, and we can improve results on MMLU (a standard modern evaluation set) by using our data.more » « less
An official website of the United States government
